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Abstract 



We present BEE, a compiler which enables to encode finite domain constraint problems to 
\ CNF. Using BEE both eases the encoding process for the user and also performs transfor- 

mations to simplify constraints and optimize their encoding to CNF. These optimizations 
are based primarily on equi-propagation and on partial evaluation, and also on the idea 
that a given constraint may have various possible CNF encodings. Often, the better encod- 
ing choice is made after constraint simplification. BEE is written in Prolog and integrates 
directly with a SAT solver through a suitable Prolog interface. We demonstrate that con- 
straint simplification is often highly beneficial when solving hard finite domain constraint 
■ problems. A BEE implementation is available with this paper. 
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1 Introduction 

In recent years, Boolean SAT solving techniques have improved dramatically. To- 
day's SAT solvers are considerably faster and able to manage larger instances than 
yesterday's. Moreover, encoding and modeling techniques are better understood and 
\ increasingly innovative. SAT is currently applied to solve a wide variety of hard and 

practical combinatorial problems, often outperforming dedicated algorithms. The 
general idea is to encode a (typically, NP) hard problem instance, /i, to a Boolean 
formula, tp^, such that the solutions of // correspond to the satisfying assignments 
of tp^. Given the encoding, a SAT solver is then applied to solve /z. 

Tailgating the success of SAT technology are a variety of tools which can be 
applied to specify and then compile problem instances to corresponding SAT in- 



stances. For example, Cadoli and Schaerf (2005) introduce NP-SPEC, a logic-based 



specification language which allows to specify combinatorial problems in a declar- 
ative way. At the core of this system is a compiler which translates specifications 
to CNF formula. The general objective of such tools is to facilitate the process of 
providing high-level descriptions of how the (constraint) problem at hand is to be 
solved. Typically, a constraint based modeling language is introduced and used to 
model instances. Drawing on the analogy to programming languages, given such a 
description, a compiler then provides a low-level executable for the underlying ma- 
chine. Namely, in our context, a formula for the underlying SAT or SMT solver. One 



obstacle when seeking to optimize CNF encodings derived from high-level descrip- 
tions, is that CNF encodings are "bit-level" representations and do not maintain 
"word-level" information. For example, from a CNF encoding one cannot know that 
certain bits originate from the same integer value in the original constraint. This 
limits the ability to apply optimizations which rely on such word-level information. 

We mention two relevant tools. Sugar (jTamura et al. 2009|) . is a SAT-based con- 
straint solver. To solve a finite domain linear constraint satisfaction problem it is 
first encoded to a CNF formula by Sugar, and then solved using the MiniSat solver 
(jEen and S orensso n 2003)) . BEE is like Sugar, but applies optimizations. Sugar is 
the first system which demonstrates the advantage in adopting the, so-called, unary 
order-encoding to represent integers. We follow suite, and introduce additional novel 
encoding techniques that take advantage of, previously unobserved, properties of 
the order-encoding. MiniZinc (|Nethercote et al. 2007|) . is a constraint modeling lan- 
guage which is compiled by a variety of solvers to the low-level target language 
FlatZinc for which there exist many solvers. It creates a standard for the source 
language (which we follow loosely). BEE is like FlatZinc, but with a focus on a 
subset of the language relevant for finite domain constraint problems. 

We present a tool, BEE (Ben-Gurion University Equi-propagation Encoder) which 
translates models in a constraint based modeling language, similar to Sugar and 
FlatZinc, to CNF. Conceptually, BEE maintains two representations for each con- 
straint in a model so that each constraint is also viewed as a Boolean function. 
Partial evaluation, and other word-level techniques, drive simplification through 
the constraint part; whereas, equi-propagation (jMetodi et al. 20TT|) . and other bit- 
level techniques, drive simplification through the Boolean part. Finally, an encoding 
technique is selected for a constraint, depending on its context, to derive a CNF. 

The name, "BEE" refers both to the constraint language as well as to its compiler 
to CNF. BEE is not a constraint solver, but can be applied in combination with a 
SAT solver to solve finite domain constraint problems. We report on our experience 
with applications which indicates that using BEE, like any compiler, has two main 
advantages. On the one hand, it facilitates the process of programming (or model- 
ing). On the other hand, given a program (a model), it simplifies the corresponding 
CNF which, in many cases, is faster to solve than with other approaches. The tool 
integrates with SWI Prolog and can be downloaded from (jMetodi 2012[) . 



2 Representing Integers 

A fundamental design choice when encoding finite domain constraints concerns 



the representation of integer variables. Gavanelli (2007) surveys several of the pos 



sible choices (the direct-, support- and log- encodings) and introduces the log- 
support encoding. We focus in this paper on the use of unary representations and 
primarily on the, so-called, order- encoding (see e.g. ([Crawford and Ba ker 1994: 
IBailleux~a nd Boufkhad 2003)) which has many nice properties when applied to 
small finite domains. We describe the setting where all integer variables are rep- 
resented in the order-encoding except for those involved in a global "all-different" 
constraint which take a dual representation with channeling between the ordcr- 
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encoding and the direct encoding. This choice derives from the observation by 



Ansotegui et al. (2004) that the direct-encoding is superior when encoding the all- 



different constraint. 

Let bit vector X = [x±, . . . , x n ] represent a finite domain integer variable. In the 
order-encoding, X constitutes a monotonic non-increasing Boolean sequence. Bit Xi 
is interpreted as X > i. For example, the value 3 in the interval [0, 5] is represented 
in 5 bits as [1, 1, 1, 0, 0]. In the direct- encoding, X constitutes a characteristic func- 
tion (exactly one bit takes value 1) and Xi is interpreted as stating X = i — 1. For 
example, the value 3 in the interval [0, 5] is represented in 6 bits as [0, 0, 0, 1, 0, 0]. 

An important property of a Boolean representation for finite domain integers 
is the ability to represent changes in the set of values a variable can take. It is 
well-known that the order-encoding facilitates the propagation of bounds. Consider 
an integer variable X = [x\ , . . . , x n ] with values in the interval [0,n]. To restrict 
X to take values in the range [a, b] (for 1 < a < b < n), it is sufficient to assign 
x a = 1 and Xb+i = (if b < n). The variables x a < and x& for > a' > o and 
b < b' < n are then determined true and false, respectively, by unit propagation. 
For example, given X = [x±, . . . ,Xg], assigning X3 = 1 and xq = propagates to 
give X = [1,1,1, X4, X5, 0, 0, 0, 0], signifying that dom(X) = {3, 4, 5}. This property 
is exploited in Sugar (jTamura et al. 2009| which also applies the order-encoding. 

We observe, and apply in BEE, an additional property of the order-encoding: 
its ability to specify that a variable cannot take a specific value < v < n in 
its domain by equating two variables: x v = x v +\. This indicates that the order- 
encoding is well-suited not only to propagate lower and upper bounds, but also to 
represent integer variables with an arbitrary, finite set, domain. For example, given 
X = [x\, . . . ,xg], equating X2 — £3 imposes that X ^ 2. Likewise £5 = x$ and 
xj = x$ impose that X ^ 5 and X ^ 7. Applying these equalities to X gives, 
X = [x\, X2, X2, £4, X5, X5, X7, xt, xg], signifying that dom(X) = {0, 1, 3, 4, 6, 8, 9}. 

The order-encoding has many additional nice features that are exploited in BEE 
to simplify constraints and their encodings to CNF. To illustrate one, consider a con- 
straint of the form A + B = 5 where A and B are integer values in the range between 
and 5 represented in the order-encoding. At the bit level we have: A = [ai, . . . , a 5 ] 
and B = [bi, . . . ,b 5 ]. The constraint is satisfied precisely when B = [^a 5 , . . . , ->ai]. 
Instead of encoding the constraint to CNF, we substitute the bits bi, . . . , b 5 by the 
literals -ia 5 , . . . , -iai, and remove the constraint. In Prolog, this is implemented as 
a unification and does not generate any clauses in the encoding. 



3 Constraints in BEE 

Boolean constants "irwe" and " false" are viewed as (integer) values "1" and "0" . 
Constraints are represented as (a list of) Prolog terms. Boolean and integer variables 
are represented as Prolog variables, which may be instantiated when simplifying 
constraints. Table Q] introduces the syntax for (a simplified subset of) BEE. In the 
table, X and Xs (possibly with subscripts) denote a literal (a Boolean variable or 
its negation) and a vector of literals, I (possibly with subscript) denotes an integer 
variable, and c (possibly with subscript) denotes an integer constant. 
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Declaring Variables 


(1) new_bool(X) 




declare Boolean X 


(2) new_mt(I, Ci, c 2 J 




declare integer I, ci < I < c 2 


(3) ordered([Xi, . . . ,X n ]j 




Xi > X 2 > ■ ■ ■ > X n (on Booleans) 


Boolean (reined) Statements 




op G {or, and, xor, iff} 


(4) bool_eq(Xi,X 2 ) or bool_eq(Xi 


-x 2 ) 


Xi = X 2 or X! = -nX 2 


(5) bool_array_op([Xi, . . . , X n ]) 




Xi op X2 ■ ■ ■ op X n 


(6) boo] — array _op_reif ( [Xi , . . . , X n 


y\ 

li X J 


Al Op A2 ■ ■ ■ Op A n <^=? A 


[( J DOO_L_Op_reiI 1 Al , A2 5 A! 




Y r,r\ Y <<-K Y 
Al Op A2 V 1 ? A 


(8) bool_array_lex(Xsi , Xs 2 ) 




Xsi precedes Xs 2 in the lex order 


Integer relations (reined) 




rel G {leq, geq, eq, It, gt, neq} 


and arithmetic op G {plus, 


times. 


div, mod, max, min}, op' G {plus, max, min} 


(9) intjrel(li,I 2 ) 




Ii rel I 2 


(10) int_rel_reif (Ii, I 2 , X) 




Ii rel I 2 <S> X 


(11) int_op(li,I 2 , I) 




Ii op I2 = I 


(12) iat_array_op'([li,...,lJ, I) 




II Op' • • • Op' I n = I 


All Different and cardinality 




relG{leq, geq, eq, It, gt, neq} 


(13) allDiff([l 1 ,...,I n ]) 




Akj ii + ij 


(14) bool_array_sum_rel([Xi , . . . , X n 


1, I) 


(E Xi) rel I 


(15) comparator(Xi , X2, X3, X4) 




sort([Xi,X 2 ]) = [X 3 ,X 4 ] 



Table 1. Syntax for a subset of BEE. 



On the right column of the table are brief explanations regarding the constraints. 
The table introduces 15 constraint templates. Constraints (1-2) are about variable 
declarations: Booleans and integers. Constraint (3) signifies that a bit sequence is 
monotonic non-increasing, and is used to specify that an integer variable is in the 
order-encoding. Constraints (4-7) are about Boolean (and reified Boolean) state- 
ments. The cases for bool_array_or([X l5 . . . , Xj) and bool_array_xor([Xi, . . . , X n ]) 
facilitate the specification of clauses and of xor clauses (supported in the Cryp- 
toMiniSAT solver (jSoos 20l0| ). Constraint (8) specifies that two bit-vectors are 
ordered lexicographically. Constraints (9-12) are about integer relations and op- 
erations. Constraints (13-14) are the all-different constraint on integers and the 
cardinality constraint on Booleans. Constraint (15) specifics that sorting a bit 
pair [Xi,X 2 ] (decreasing order) results in the pair [X 3 ,X 4 ]. This is a basic build- 
ing block for the construction of sorting networks (jBatcher 1968|) used to encode 
cardinality constraints during compilation as described in (jAsin et al. 201 lj) and 
in QCodish and Zazon-Ivry 2010[ ). 
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:- use_module(bee_compiler , [compile/2]). 
:- use_module(sat_solver , [sat/1]). 

solve (Instance , Solution) :- 

encode (Instance , Map, Constraints), 
compile (Constraints , CNF), 
sat (CNF) , 

decode (Map, Solution). 
Fig. 1. A generic application of BEE. 



4 An Example BEE Application: magic graph labeling 

We illustrate the application of BEE to solve a graph labeling problem. A typical 
BEE application has the form depicted as Figure [T] where the predicate solve/2 
takes a problem Instance and provides a Solution. The specifics of the application 
are in the call to encode/3 which given the Instance generates the Constraints 
that solve it together with a Map relating instance variables with constraint vari- 
ables. The calls to compile /2 and sat/1 compile the constraints to a CNF and solve 
it applying a SAT solver. If the instance has a solution, the SAT solver binds the 
constraint variables accordingly. Then, the call to decode/2, using the Map, pro- 
vides a Solution in terms of the instance variables. The definitions of encode/3 
and decode/3 are application dependent and provided by the user. The predicates 
compile/2 and sat/1 provide the interface to BEE and the underlying SAT solver. 

Graph labeling is about finding an assignment of integers to the vertices and 
edges of a graph subject to certain conditions. Graph labelings were introduced in 
the 60's and hundreds of papers on a wide variety of related problems have been 



published since then. See for example the survey by Gallian (2011 ) with more than 
1200 references. Graph labelings have many applications. For instance in radars, 
xray crystallography, coding theory, etc. 

We focus here on the vertex-magic total labeling (VMTL) problem where one 
should find for the graph G = (V, E) a labeling that is a one-to-one map V D E ^ 
{1,2, . . . ,\V\ + \E\} with the property that the sum of the labels of a vertex and 
its incident edges is a constant K independent of the choice of vertex. A problem 
instance takes the form vmtl(G,K) specifying the graph G and a constant K. 
The query solve(vmtl(G, K), Solution) poses the question: "Does there exist a 
vmtl labeling for G with magic constant KT' It binds Solution to indicate such 
a labeling if one exists, or to "unsat" otherwise. Figure [2] illustrates an example 
problem instance together with the constraints, Cs and the map, M, generated by 
the encode/3 predicate for this instance. The constraints introduce integer variables 
for the vertices and edges, specify that these variables take "all different" values, 
and specify that the labels for each vertex with its incident edges sum to K. Solving 
the constraints from Figure [2] for the example VMTL instance binds the Map, M, 
as follows, indicating a solution: 



M = 



((1,2), [1,1,1,1,1,1,1,0]), (1, [1,1,1,1,0,0,0,0]), 

((1,3), [1,1,1,0,0,0,0,0]), (2, [1,1,1,1,1,0,0,0]), 

((2,3), [1,1,0,0,0,0,0,0]), (3, [1,0,0,0,0,0,0,0]), 

((3,4), [1,1,1,1,1,1,1,1]), (4, [1,1,1,1,1,1,0,0]) 
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An Instance 



The Graph 



The Map 



((1,2), Ei), (1, Vi), 

((1,3), E 2 ), (2, V a ), 

((2,3), E 3 ), (3, V 3 ), 

((3,4), E 4 ), (4, V 4 ) 



Instance = vmtl(G,K), 
G= (V,E), 
V= [1,2,3,4], 
E= [(1,2), (1,3), 
(2,3), (3,4)], 
K = 14 



The Constraints 



new_int(Vi, 1,8), 
new_int(V2, 1,8), 
new_int(V 3 , 1, 8), 
new_int(V4, 1,8), 
new_int(K, 14, 14) 



new_int(Ei, 1,8), int_array_plus([Vi , Ei, E2], K), 
new_int(E2, 1,8), int_array_plus([V2, Ei, E3], K), 
new_int(E3, 1, 8), int_array_plus([V3, E2, E3, E4], K) 
new_int(E4, 1, 8), int_array_plus([V4, E4], K), 
allDiff([V 1 ,V 2 ,V3,V4,E 1 ,E 2 ,E 3 ,E 4 ]) 



Fig. 2. A VMTL instance with the constraints and map generated by encode/3. 

In Section [9] we report that using BEE enables us to solve interesting instances of 
the VMTL problem not previously solvable by other techniques. 

5 Compiling BEE to CNF 

The compilation of a constraint model to a CNF using BEE goes through three 
phases. In the first phase, (unary) bit blasting, integer variables (and constants) 
are represented as bit vectors in the order-encoding. Now all constraints are about 
Boolean variables. The second phase, the main loop of the compiler, is about con- 
straint simplification. Three types of actions are applied: equi-propagation, partial 
evaluation, and decomposition of constraints. These are specified as a set of transi- 

Q 

tions which we write in the form c\ 1 — > C2 to specify that constraint c\ reduces to 
constraint C2 generating the (possibly empty) substitution 6. Simplification is ap- 
plied repeatedly until no rule is applicable. In the third, and final phase, simplified 
constraints are encoded to CNF. We elaborate below. To simplify the presenta- 
tion, we assume that integer variables are represented in a positive interval starting 
from 0. As later detailed in Section |8] there is no such limitation in BEE. 

Bit-blasting: Each integer variable declaration new_int(l, ci, C2) triggers a unifica- 
tion I = [1, . . . , 1, X Cl+1 , . . . , X C2 ] and introduces a constraint ordered(l) to specify 
that the bits representing I arc in the order-encoding. To illustrate bit-blasting, 
consider again the VMTL example detailed in Figure EJ Each variable in the Map 
occurs in a new_int declaration. So the following unifications are performed: 

Vi = [1, Vi,2, Vi,3, Vi,4, Vi,5, Vi, 6 , Vi,7, Vi, 8 ], Ei = [1, Ei, 2, Ei, 3, Ei,4, Ei, 5, Ei, 6, Ei, 7, Ei, 8 ], 

V2 — [l, V2,2, V2,3, V2,4, V2,5, V2,6, V2,7, V2,s], E2 = [l , E2,2 , E2,3 , E2,4 , E2,5 , E2,6 , E2,7 , E2,s] , 
V3 = [1, V3, 2 , V3,3, V 3 ,4, V3,5, V3,6, V3,7, V3,8], E3 = [l , E 3 ,2 , E3,3 , E3,4 , £3,5 , E3,6 , E3,7 , E3,e] , 
V 4 = [1, V 4 ,2, V4,3,V4,4, V4,6, V4,6,V4,7, V 4 ,8], E4 = [l, E4,2 , E4,3 , E4,4, E4,5 , E4,6 , E4,7, E4,s] , 

K= [1,1,1,1,1,1,1,1,1,1,1,1,1,1] 
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Integer variables occurring in an allDif f constraint are bit-blasted twice: first, in 
the order-encoding, when declared, as explained above, and second, in the direct 
encoding, when processing the allDif f constraint, as described below. 

Equi-propagation is about detecting situations in which a small number of con- 
straints imply an equality of the form X = L where X is a Boolean variable and L is 
a Boolean literal or constant. In this case X becomes redundant and can be replaced 
by L in all constraints. In BEE we consider as candidates for equi-propagation, indi- 
vidual constraints together with constraints specifying that their integer variables 
are in the order-encoding. If X = L is such an equality, then equi-propagation 
is implemented by unifying X and L. This unification applies to all occurrences 
of X and in this sense "propagates" to other constraints involving X. Once equi- 
propagation detects such an equation, this may trigger further equi-propagation 
from other constraints. For example, consider the constraint int_neq(li, I 2 ) where 
Ii = [xi, x 2 , x 3 , x 4 ] and I 2 = [1, 1,0,0]. We propagate that (x 2 = x 3 ) because 

Ii = [xi,x 2 ,X3,X4] A I a = [1,1,0,0] A 
int_neq(li, I 2 ) A ordered(li) 

To see why, consider that ordered(li) implies that x 2 > x 3 . Furthermore, also 
x 2 < x 3 as otherwise x 2 = 1 and x 3 = which implies that Ii = [1, 1,0,0], contra- 
dicting int_neq(li, I 2 ). 

In BEE, equi-propagation is implemented by a collection of ad- hoc transition 
rules for each type of constraint. While this approach is not complete — there are 
equations implied by a constraint that BEE will not detect — the implementation 
is fast, and works well in practice. An alternative approach is to implement equi- 
propagation, using BDD's, as described in (IMetodi et al. 20lT|) . This approach, 
though complete, is slower and not included in the current release of BEE. 

The following are two of the simplification (equi-propagation) rules of BEE that 
apply to int_neq constraints: 

neqi : applies when one of the (order-encoding) integers in the relation is a constant 
and 9 = {X 1 = X 2 }: 

. . ( [■ ■ ■ ,X ll X 2 ,. ■ ■} \ 6 . ( [...,X U X X , 

mt_aeq . . i — > mt_neq 



H ( X 2 = x 3 ). 



i, o, ...] J "V [•••> i, o, 

neq 2 : applies when the integers share common variables as in the rule template 
and 9 = {Xt = X 2 }: 

( [•••, X u X 2 , ...], \ e . / [..., X u X u 

mt_rieq . . i — > mt_neq [ 



X 2 , ->Xx ,...]/ V I ■ ■ • ' ~^X\ , -ill , ■ • ■ ] 

For the rule neqi, observe that after applying this rule the constraint obtained is 
a tautology. Hence it is subsequently removed by one of the other "partial evalua- 
tion" rules. For the rule neq 2 , to see why the equation X\ = X 2 is implied by the 
constraint (on the left side of the rule), consider all possible truth values for the vari- 
ables X\ and X 2 : (a) If X\ = and X 2 = 1 then both integers in the relation take 
the form [. . . , 0, 1, . . .] violating their specification as ordered, so this is not possi- 
ble, (b) If Xi — 1 and X 2 = then both numbers take the form [1, . . . , 1, 0, . . . , 0] 
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and are equal, violating the neq constraint. The only possible bindings for X\ and 
X2 are those where X\ = X^. The template expressed in rule neq 2 is not contrived. 
It comes up frequently as a result of applying other equi-propagation rules. 

Partial evaluation is about simplifying constraints in view of variables that are 
(partially) instantiated, either because of information from the constraint model 
or else due to equi-propagation. Typical cases include constant elimination and 
elimination of tautologies. The following are some of BEE's partial evaluation rules 
that apply to int_neq constraints (e denotes the empty substitution). 

neq 3 : applies to remove replicated variables: 

neq 4 : applies to remove leading 1 bits (there is a similar rule for trailing O's): 

int_neq([l, 1,X 3 , . . .], [Y^Y^Ys, . . .]) ^ int_Qeq([l, X 3 , . . .], [Y 2 ,Y 3 , . . .]) 

We now detail three of the simplification rules (equi-propagation and partial 
evaluation) that apply to a constraint of the form int_plus(A,B,C) where we 
assume for simplicity of presentation (the tool supports the general case) that 
A = [Ai, . . . , An], B = [Bi, . . . , B m ], and C = [C 1; . . . , C n+ J. We denote by min(l) (or 
max(l)) the minimal (or maximal) value that integer variable I can take, determined 
by the number of leading ones (or trailing zeros) in its bit representation. 

Rule plusi is standard propagation for interval arithmetics. Rule plus 2 removes 
redundant bits (assigned values through plusi). Rules plus 3 ( a ) and plus 3 ( b ) remove 
constraints and may seem contrived: 3(a) assumes that m = and 3(b) assumes 
that n = m and that C represents the (same) constant n. However, in the general case, 
when n, m are arbitrary and constant C is represented in m + n bits, then application 
of the other rules will reduce the constraint to one of these special cases. 

plusi : applies to propagate bounds: int_plus(A, B, C) h-^->- int_plus(A, B, C) where 

Cmax{min(C) ,min(A)+min(B)} 1 ) ^min{max(C) .7nax(A)-\-max(B)} + l 0? 
Amax{min(A) ,min(C)— max(B)} 1 : J ^-min{max(A)^max(C)—min(B)}~\-l 
^max{min(B) ,min(C) — max(A)} 1 ? ^min{max(B) ^max(C) —min(A)}-]-l 

plus 2 : applies to remove leading l's (there are similar rules for trailing O's and for 
the case when the l's or O's are on [Bj., . . . ,Bj): 

[A 2 ,...,A n ], 





int_plus ( [B 1; ...,B m ], | h— >• int.plus ( [B l7 ...,B m ], 

plus 3 ( a ) : applies when A or B is the empty bit list and 9 = { C, = Ai\ 1 < i < n } 

int_plus([A!, . . . , A n ], [ ], [Ci, . . . ,C n ]) i— >■ none 
plus 3 ( b ) : applies when C is a constant n and 8 = { Ai — -iB n —i+i | 1 < i < n } 

int_plus([Ai, . . . , A n ], [Bi, . . . , B n ], [1, . . . 1, 0, . . . , 0]) t- 9 ^ none 
We illustrate the simplification of a int_plus constraint by the following example. 
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Example 1 [simplifying intjplus: equi- propagation and partial evaluation) 
Consider constraint int_plus(A, B, C) where A and B are integer variables with do- 
main [1..8] and C is the constant 14 represented in 16 bits. Constraint simplification 
follows the steps: 



int_plus( 

[1, A 2 ,A3,A4,A 5 ,A 6 ,A 7 ,A 8 ], 
[1, B 2 , B 3 , B 4 , B 5 , B 6 , B 7 , B 8 ], 

1, ,1,0,0] 



) 



plusi 



int_plus( 




[1,1,1,1,1, 


1,A 7 ,A 8 ], 


[1,1,1,1,1, 


1,B 7 ,B 8 ], 


[1, 


,1,0,0] 


14 times 




) 





plUS2 



pll!S2 



int_plus 



[A 7 ,A 8 ], [B 7 ,B 8 

[1,1,0,0] 



plus 3(b) 



none, binding : 

B 7 = ^A 8 , B 8 = ^A 7 ) 



After constraint simplification variables A and B take the form: [1, 1, 1, 1, 1, 1, A 7 , A 8 ] 
and [1,1,1,1,1,1, -iA 8 , -iA 7 ] (and nothing is left to encode to CNF). 

Decomposition is about replacing complex constraints (for example about arrays) 
with simpler constraints (for example about array elements). Consider, for instance, 
the constraint int_array_plus(As, Sum). It is decomposed to a list of int_plus 
constraints applying a straightforward divide and conquer recursive definition. At 
the base case, if As=[A] then the constraint is replaced by int_eq(A , Sum) , or if 
As = [Ai,A 2 ] then it is replaced by int_plus(Ai, A 2 , Sum). In the general case As is 
split into two halves, then constraints are generated to sum these halves, and then 
an additional int_plus constraint is introduced to sum the two sums. 

As another example, consider the int_plus(Ai, A 2 , A) constraint. One approach, 
supported by BEE, decomposes the constraint as an odd-even merger (from the con- 
text of odd-even sorting networks) ([Batcher 1968j) . Here, the sorted sequences of bits 
Ai and A 2 are merged to obtain their sum A. This results in a model with O(nlogn) 
comparator constraints (and later in an encoding with O(nlogn) clauses). An- 
other approach, also supported in BEE, does not decompose the constraint but 
encodes it directly to a CNF of size 0(n 2 ), as in the context of so-called total- 
izers (iBaillcux and Boufkhad 2003|l . A hybrid approach, leaves the choice to BEE, 
depending on the size of the domains of the variables involved. Finally, we note that 
the user can configure BEE to fix the way it compiles this constraint (and others). 



CNF encoding is the last phase and applies to all remaining simplified constraints. 
The encoding of constraints to CNF is standard and similar to the encodings in 
Sugar (jTamura et al. 2009[) . 

Cardinality constraints are about the cardinality of sets of Boolean variables and 
are specified by the template bool_array_sum_rel([Xi, . . . , X n ], I). Cardinality con- 
straints are normalized, see e.g., (jEen and Sorensson 2006|) . so we only consider 
rel £ {leq, eq}. Partial evaluation rules for cardinality constraints are the obvi- 
ous. For example, in the special case when I is a constant: 
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bool_array_sum_leq([Xi, X 2 , 1, X 4 ], 3) t— > bool_array_sum_leq([Xi, X 2 , X 4 ], 2) 
bool_array_sum_leq([Xi, X 2 , 0, X 4 ], 3) H- bool_array_sum_leq([Xi, X 2 , X 4 ], 3) 
bool_array_sum_leq([Xi, X 2 , — Xi, X 4 ], 3) i— > bool_array_sum_leq([X 2 , X 4 ], 2) 

The special case, when I is the constant 1 is called the "at-most-one" constraint and 
it has been studied extensively (for a recent survey see (jFrisch and Giann aros 2010[) ) . 
In BEE, we support two different encodings for this case (the user can choose). The 
first is the standard "pairwise" encoding which specifies a clause (^Xi V -*Xj) for 
each pair of Boolean variables Xi and Xj. This encoding introduces 0(n 2 ) clauses 
and is sometimes too large. The second, is a more compact encoding which fol- 
lows the approach described in (Chen 2010). In the general case (when I > 1) the 
constraint is decomposed, much the same as the int_array_plus constraint, to a 
network of int_plus constraints. 



The All-different constraint specifics that a set of integer variables take all different 
values. Although wc adopt the order-encoding for integer variables, it is well ac- 



cepted that for these constraints the direct encoding is superior ( Ansotegui et aL 2004 ) 
For this reason, in BEE, when processing the constraint, a dual representation is 
chosen. When integer variable I, occurring in an allDiff constraint, is declared, it 
was unified with its unary representation in the order-encoding: I = [x 1; . . . , x n ]. In 
addition, wc associate I with a new bit-blast, [d , . . . , d n ], in the direct encoding. 
We introduce for each such I a channeling formula to capture the relation between 
its two representations. 



n-1 



channel([x 1 ,...,x n ],[d ,...,d n ]) = ( ^° _ Xl ) A A (d± -O- x ± A -■x i+1 ) 

\ A u n — X n J ^ _^ 

During constraint simplification, the allDif f([li, I n ]) constraint is viewed 
as a bit matrix where each row consists of the bits [d i0 , . . . , d im ] for Ii in the direct 
encoding. The clement d^ is true iff Ii takes the value j. The j th column specifics 
which of the Ii take the value j and hence, at most one variable in a column may 
take the value true. BEE distinguishes the special case when . . . , I n ] must take 
precisely n different values. In this case the constraint is about "permutation" . Wc 
denote this by a flag (*) as in allDif f *([li, . . . , I n ]). In this case, exactly one bit 
in each column of the representation must take the value true. 

To simplify an allDif f constraint, BEE applies simplification rules to the implicit 
cardinality constraints on the columns and also two specific allDif f rules. The first 



is essentially the usual domain consistent propagator (Rcgin 1994) focusing on Hall 
sets of size 2. The second rule applies only to an allDif f * constraint which is about 
permutation. Wc denote the values that Ii can take as dom(li). 



allDif fi : when dom(li) = dom(l 2 ) = {v!,v 2 }: 

allDif f ([Ii, I 2 , 1 3 , . . . , I n ]) "A allDif f([l 3 , . . . , Ij) 

where 6 = U3<i< n {^i,fi = °j^> 2 = 0} U {d 1>vi = —d 2 , Vl ,di iV2 = —<h,v a }. 
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allDif f 2 : when {vi, v 2 } C dom(li) n dom(l 2 ), and for i > 3, {vi, v 2 } fl dom(li) = 
allDiff *([I U Ij) A allDif f . . . , I„]) 
where = \J^ Vl ^ V2 {d l3 = 0,d 2 ,j = 0}. 
To illustrate the two rules for allDiff consider the following. 

Example 2 

Consider an allDiff constraint on 5 integer variables taking values in the interval 
[0,7] where the first two can take only values and 1. So, they are a Hall set 
of size two and rule allDiff i applies. We present the simplification step on the 
order encoding representation (though it is triggered through the direct encoding 
representation) : 



allDiff 



/ [*L,1: 

[*S,1. 
[-X4,l : 
V [*5,l : 



0. 

o, 

X-4,2; 
X$ 2; 



where 



n A2.i , X- 



0] 
0] 

^3,7] 

^4,7] 
x 5 M 



J 



allDiff 




,Xs,r] 
,^5,7] 



; 1 l = J Jf 4l l=0 J Jf Bl l = 0}. 

Now consider a setting where an allDiff constraint is about 5 variables that 
can take 5 values (permutation) and the first two are the only two that can take 
values and 1. So rule allDif f 2 applies. We present the simplification step on the 
order encoding representation (though it is triggered through the direct encoding 
representation) : 



allDiff* 



f [-Xl,l, -Xl,2, -Xl,3, -Xl,4] \ 
[X2.I , X2,2, X2.Z, X2,i] 
[1, 1, X3, 3,^3,4] 

[1, 1, ^4,3,^4,4] 

\ [i, 1, x 5:S ,x 5A ] J 



allDiff* 



/ [Xl,l: 

[1, 
[1, 



0, 0] \ 
0, 

^3,3,-^3,4 
^4,3, X4,4 



1, ^5,3,^5,4] / 



where 8 = {X12, ■ ■ ■ , X\ : . 



0,X 2 



,X, 



0}. 



When no further simplification rules apply the allDiff constraint is decom- 
posed to the corresponding cardinality constraints on the columns of its bit matrix 
representation. 



6 Constraint simplification in the VMTL example 

Consider again the VMTL example and the constraints from Figure [3] We focus on 
three constraints and follow the steps made when compiling these (we write "14" 
as short for [1, 1, . . . , 1]). 

14 

(1) int_array_plus([V 4 ,E 4 ], 14) 

(2) allDif f ([Vi, V a , V 3 , V 4 , Ei, E 2 , E 3 , E 4 ]), 

(3) int_array_plus([V 3 ,E 2 ,E 3 ,E 4 ], 14), 

In the first steps, constraint (1) is decomposed to an int_plus constraint which 
has the same form as the constraint in Example [TJ So, we have the bindings 
V 4 = [1, 1, 1, 1, 1, 1, V 4j7 , V 4 , 8 ] and E 4 = [1, 1, 1, 1, 1, 1, ->V 4i8 , ->V 4i7 ]. Now, consider the 
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allDif f constraint (2). BEE determines that this constraint is about permutation 
(8 integer variables with 8 different values in the range [1,8]). The simplification 
rules for allDif f detect that {V 4 , E 4 } must take together the two values 6 and 8 (us- 
ing a simplification rule similar to neq 2 ) triggerring the substitution {V 4 . 7 = V 4i8 }. 
Now rule allDif fi detects a Hall set {V 4 ,E 4 } of size two: 

allDif f ([Vi, V a , V 3j V 4 , E u E 2 , E 3 , E 4 ]) A allDif f ([Vj, V 2 , V 3 , E 1; E 2 , E 3 ]) 

where 9 is the unification that imposes Vi, V 2 , V 3 , Ei, E 2 , E 3 ^ 6,8. So we have the 
following bindings (where the impact of 9 is underlined) : 

Vi = [l,Vi,2,Vi.3,Vi.4,Vi,5, Vi.7,Vi.7 ,0] Ej = [1, E li2 , E 1>3 , E 1|4 , E^g, E t , 7 , E li7 , 0] 

V 2 = [1, V 2 , 2 , V 2 , 3 , V 2 ,4, V 2 ,5, V 2 ,7, V 2 ,7, 0] E 2 = [l , E 2 ,2 , E 2 ,3 , E 2 ,4 , E 2 ,5 , E 2 ,7 , E 2 ,7 , 0] 

V 3 = [1, V 3 , 2 , V 3 , 3 , V 3 ,4, V 3 , 5 , V 3 , 7 , V 3 , 7 , 0] E 3 = [1, E 3 , 2 , E 3 , 3 , E 3 , 4 , E 3 , 5 , E 3 , 7 , E 3 , 7 , 0] 

V 4 = [1, 1,1,1, 1,1, V 4 ,7,V 4 ,7] E 4 = [1, 1, 1, 1, 1, 1, -.V 4 ,7, -V 4 , 7 ] 

Consider now the constraint (3). Equi-propagation (because of bounds) dictates 
that max(Vi) = max(V 2 ) = max(V 3 ) = 5, so this constraint then simplifies as follows: 



int_array_plus( [ 




int_array_plus([ 


[1,V 3 , 2 ,V 3 , 3 ,V 3 , 4 ,V 3 ,5, 0,0,0], 




[V 3 , 2 , V 3 , 3 , V 3 ,4, V 3 ,5], 


[1,E 2 , 2 ,E 2 , 3 ,E 2 ,4,E 2 ,5, 0,0,0], 


1 — > 


[E 2 , 2 , E 2 , 3 , E 2 ,4, E 2 ,5], 


[1, E 3j2 , E 3 , 3 , E 3 , 4 , E 3 , 5 , 0, 0, 0], 




[E 3 ,2, E 3 , 3 , E 3 , 4 , E 3 , 5 ], 


[1,1, 1,1,1, 1,^V 4 ,7,^V 4 ,7], 14]) 




hV 4 ,7,-V 4 ,7], 5]) 



After applying simplification and decomposition rules on all the constraints from 
Figure [5] until no further rules can be applyed, the constraints will be encoded to 
CNF. The generated CNF contains 301 clauses and 48 Boolean variables. Compiling 
the same set of constraints from Figure [5] without applying simplification rules 
generates a larger CNF which contains 642 clauses and 97 Boolean variables. 



7 Another Example BEE Application: DNA word problem 

The DNA word problem (Problem 033 of CSPLib) seeks the largest parameter 
n, such that there exists a set S oi n eight-letter words over the alphabet £ = 
{A, C, G, T} with the following properties: (1) Each word in 5* has exactly 4 symbols 
from {C, G}; (2) Each pair of distinct words in S differ in at least 4 positions; and 
(3) For every x,y £ S: x R (the reverse of x) and y (the word obtained by replacing 
each A by T, each C by G, and vice versa) differ in at least 4 positions. 

In (jFrutos et al. 1997]) , the authors present a strategy to solve this problem where 
the four letters are modeled by bit-pairs (t,m). Each eight-letter word can then be 
viewed as the combination of a "t-part", (£1, . . . ,is), which is a bit-vector, and 
a "m-part", {m\, . . . ,m%), also a bit- vector. Building on the approach described 
in (jFrutos et al. 199"7j) . we pose conditions on sets of "t-parts" and "m-parts", T 
and M, so that their Cartesian product S = T x M will satisfy the requirements 
of the original problem. From the three conditions below, T is required to satisfy 
(1') and (2'), and M is required to satisfy (2') and (3'). For a set of bit- vectors V, 
the conditions are: (1') Each bit-vector in V sums to 4; (2') Each pair of distinct 
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bit-vectors in V differ in at least 4 positions; and (3') For each pair of bit- vectors 
(not necessarily distinct) u, v € V, u R (the reverse of u) and v (the complement 
of v) differ in at least 4 positions. This is equivalent to requiring that (u r ) c differs 
from v in at least 4 positions. 

It is this strategy that we model in our BEE encoding. An instance takes the 
form dna(ni, n 2 ) signifying the numbers of bit-vectors, n\ and ni in the sets T and 
M. Without loss of generality, we impose, to remove symmetries, that T and M 
are lexicographically ordered. A solution is the Cartesian product S = T x M. In 
Section IH1 we report that using BEE enables us to solve interesting instances of the 
problem not previously solvable by other techniques. 



8 Implementation 

BEE is implemented in (SWI) Prolog and can be applied in conjunction with the 
CryptoMiniSAT solver (jSoos 2010|) through a Prolog interface (|Codish et al. 2008)1 . 
BEE can be downloaded from (jMetodi 2012|) where one can find also the examples 
from this paper and others. The distribution includes also a solver, which we call 
BumbleBEE, which enables to specify a BEE model as an input file and solve it. 
The output is a set of bindings to the declared variables in the model. 

In BEE, Boolean variables are represented as Prolog variables. The negation of X 
is represented as -X. The truth values, true and false, are denoted 1 and -1. Integer 
variables (including negative range values) are represented in the order-encoding. 
When processing (bit-blasting) a declaration new_int(I, Min, Max), Prolog variable I 
is unified with the tuple (Min, Max, Bits , LastBit) where Min and Max are constants 
indicating the interval domain of I, Bits is a list of (Max — Min) variables, and 
LastBit is the last variable of Bits. This representation is more concise than the 
one assumed for simplicity in the previous sections and it also supports negative 
numbers. Maintaining direct access to the last bit in the representation (we already 
can access the first bit through the list Bits) facilitates a (constant time) check 
if the lower and upper bound values of a variable has changed. This way we can 
more efficiently determine when (certain) simplification rules apply. We make a few 
notes: (1) Integer variables must be declared before use; (2) BEE allows the use of 
constants in constraints instead of declaring them as integer variables (for example 
int_gt(l,5) represents a declaration new_int(l', 5, 5) together with the constraint 
int_gt(I, I')); (3) integer variables can be negated. 

BEE maintains constraints as a Prolog list (of terms). Each type of constraint is 
associated with corresponding rules for simplification, decomposition, and encod- 
ing to CNF. After bit-blasting, constraints are first simplified (equi-propagation 
and partial evaluation) using these rules until no further rules apply. During this 
process, if a pair of literals is equated (e.g. as in X=Y, X=-Y, X=l, X=-l), then 
they are unified, thus propagating the effect to other constraints. After constraint 
simplification, some constraints are decomposed, and this process repeats. We end 
up with a set of "basic" constraints (which cannot be further decomposed or sim- 
plified). These are then encoded to CNF. 
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instance BEE (dual encoding) BEE (order encoding) Sugar 







comp 


clauses 


vars 


sat 


comp 


clauses 


vars 


sat 


clauses 


vars 


sat 


25-264-0 


sat 


0.23 


6509 


1317 


0.33 


0.36 


33224 


887 


8.95 


126733 


10770 


34.20 


25-264-1 


sat 


0.20 


7475 


1508 


3.29 


0.30 


34323 


917 


97.50 


127222 


10798 


13.93 


25-264-2 


sat 


0.21 


6531 


1329 


0.07 


0.30 


35238 


905 


2.46 


127062 


10787 


8.06 


25-264-3 


sat 


0.21 


6819 


1374 


0.83 


0.29 


32457 


899 


18.52 


127757 


10827 


44.03 


25-264-4 


sat 


0.21 


7082 


1431 


0.34 


0.29 


32825 


897 


19.08 


126777 


10779 


85.92 


25-264-5 


sat 


0.21 


7055 


1431 


3.12 


0.30 


33590 


897 


46.15 


126973 


10784 


41.04 


25-264-6 


sat 


0.21 


7712 


1551 


0.34 


0.33 


39015 


932 


69.81 


128354 


10850 


12.67 


25-264-7 


sat 


0.21 


7428 


1496 


0.13 


0.30 


36580 


937 


19.93 


127106 


10794 


7.01 


25-264-8 


sat 


0.21 


6603 


1335 


0.18 


0.27 


31561 


896 


10.32 


124153 


10687 


9.69 


25-264-9 


sat 


0.21 


6784 


1350 


0.19 


0.27 


35404 


903 


34.08 


128423 


10853 


38.80 


25-264-10 


unsat 


0.21 


6491 


1296 


0.04 


0.30 


33321 


930 


10.92 


126999 


10785 


57.75 


25-264-11 


unsat 


0.12 


1 





0.00 


0.28 


37912 


955 


0.09 


125373 


10744 


0.47 


25-264-12 


unsat 


0.16 


1 





0.00 


0.29 


39135 


984 


0.08 


127539 


10815 


0.57 


25-264-13 


unsat 


0.12 


1 





0.00 


0.29 


35048 


944 


0.09 


127026 


10786 


0.56 


25-264-14 


unsat 


0.23 


5984 


1210 


0.07 


0.28 


31093 


885 


11.60 


126628 


10771 


15.93 


Total 










8.93 | 






349.58 






370.63 



Tabic 2. QCP results for 25 x 25 instances with 264 holes 

9 Experiments 

We report on our experience in applying BEE. To appreciate the ease in its use, 
and for further details, the reader is encouraged to view the example encodings 
available with the tool ([Metodi 2012[) . All experiments run on an Intel Core 2 
Duo E8400 3.00GHz CPU with 4GB memory under Linux (Ubuntu lucid, kernel 
2.6.32-24-gcneric). BEE is written in Prolog and run using SWI Prolog v6.0.2 64- 
bits. Comparisons with Sugar (vl.15.0) are based on the use of identical constraint 
models, apply the same SAT solver (CryptoMiniSat v2.5.1), and run on the same 
machine. For all of the tables describing experiments, columns indicate: 

comp: compile time (seconds) vars: number of CNF variables 

clauses: number of CNF clauses sat: SAT solving time (seconds) 

We first focus on the impact of the dual representation for allDif f constraints. 
We report on the application of BEE to Quasi-group completion problems (QCP), 



proposed by Gomes et al. (1997) as a constraint satisfaction benchmark, where the 



model is a conjunction of allDif f constraints. 

Quasi-group completion: We consider 15 instances from the 2008 CSP competi- 
tion^]. Table [2] considers three settings: BEE with its dual encoding for allDiff 
constraints, BEE using only the order encoding (equivalent to using int_neq con- 
straints instead of allDif f), and Sugar. The results indicate that: (1) Application 
of BEE using the dual representation for allDif f is 38 times faster and produces 
20 times less clauses (in average) than when using the order-encoding alone (de- 
spite the need to maintain two encodings); (2) Without the dual representation, 
solving encodings generated by BEE is only slightly faster but BEE generates CNF 



1 |http : //www . cril . univ-artois . f r/CPAI08/ 
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if 8 | k | BEE Sugar 



K w | k | BEE Sugar 



k | BEE Sugar 



143 


1.26 


2.87 


277 


5.31 


9.25 


267 


88.51 


175.70 


142 


10.14 


1.62 


276 


7.11 


9.91 


266 


229.80 


247.56 


141 


7.64 


2.94 


275 


13.57 


19.63 


265 


1335.31 


259.45 


140 


14.68 


6.46 


274 


4.93 


9.24 


264 


486.09 


513.61 


139 


25.60 


6.67 


273 


45.94 


9.03 


263 


236.68 


648.43 


138 


12.99 


2.80 


272 


22.74 


86.45 


262 


1843.70 


6429.25 


137 


22.91 


298.58 


271 


7.35 


9.49 


261 


2771.60 


7872.76 


136 


14.46 


251.82 


270 


6.03 


55.94 


260 


4873.99 


oo 


135 


298.54 


182.90 


269 


5.20 


11.05 


259 


oo 


oo 


134 


331.80 


oo 


268 


94.44 


424.89 


258 


oo 


oo 



Average 



clauses X 1000 I 248 I 402 

vars 5688 9370 



Average 



clauses xlOOO 
vars 



1229 1966 
15529 25688 



Tabic 3. VMTL results for K$ and Kiq (times are in seconds) 



encodings 4 times smaller (on average) than those generated by Sugar. Observe 
that 3 instances are found unsatisfiable by BEE (indicated by a CNF with a single 
clause and no variables). We comment that Sugar preprocessing times are higher 
than those of BEE and not indicated in the table. 

To further appreciate the impact of the tool we describe results for three ad- 
ditional applications which shift the state-of-the-art with respect to what could 
previously be solved. The experiments clearly illustrate that BEE decreases the size 
of CNF encodings as well as the subsequent SAT solving time. 

Magic labels: In ( jMacDougall et al. 2002[ ) the authors conjecture that the n vertex 
complete graph, K„ , for n > 5 has a vertex magic total labeling with magic con- 
stants for specific range of values of k, determined by n. This conjecture is proved 
correct for all odd n and verified by brute force for n = 6. We address the cases for 
n = 8 and n = 10 which involve 15 instances (different values of K) for n = 8, and 
23 (different values of K) for n = 10. Starting from the simple constraint model 
(illustrated by the example in Figure [5]), we add additional constraints to exploit 
that the graphs are symmetric: (1) We assume that the edge with the smallest 
label is ei^; (2) We assume that the labels of the edges incident to V\ are ordered 
and hence introduce constraints e± : 2 < < • • • < ei >n ; (3) We assume that the 
label of edge ei ; 3 is smaller than the labels of the edges incident to v 2 (except ei^) 
and introduce constraints accordingly. In this setting BEE can solve all except 2 
instances with a 4 hour timeout and Sugar can solve all except 4. 

Table [3] depicts results for the 10 hardest instances for K% and the 20 hardest 
for Kiq with a 4 hour time-out. BEE compilation times are on the order of 0.5 
sec/instance for Kg and 2.5 sec/instance for Kiq. Sugar encoding times are slightly 
larger. The instances are indicated by the magic constant, /c; the columns for BEE 
and Sugar indicate SAT solving times (in seconds). The bottom two lines indicate 
average encoding sizes (numbers of clauses and variables) . 

The results indicate that the Sugar encodings are (in average) about 60% larger, 
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while the average SAT solving time for the BEE encodings is about 2 times faster 
(average excluding instances where Sugar times-out). 

To address the two VMTL instances not solvable using the BEE models described 
above (JCio with magic labels 259 and 258), we partition the problem hxing the 
values of ex,2 and e± t 3 and maintaining all of the other constraints. Analysis of the 
symmetry breaking constraints indicates that this results in 198 new instances for 
each of the two cases. The original VMTL instance is solved if any one of of these 
198 instances is solved. So, we solve them in parallel. Fixing ei,2 and e± t 3 "fuels" 
the compiler so the encodings are considerably smaller. The instance for k = 259 
is solved in 1379.50 seconds where ei.2 = 1 and ei.3 = 6. The compilation time is 
2.09 seconds and the encoding consists in 1056107 clauses and 14143 variables. 

To the best of our knowledge, the hard instances from this suite are beyond the 
reach of all previous approaches to program the search for magic labels. The SAT 
based approach presented in ( |Jager 2010[ ) cannot handle thescH The comparison 
with Sugar indicates the impact of the compiler. 



DNA word problem: Mancini et al. (2008 1 provide a comparison of several state 



of-the-art solvers applied to the DNA word problem with a variety of encoding 
techniques. Their best reported result is a solution with 87 DNA words, obtained in 
554 seconds, using an OPL ( |van Hentenryck 1999[ ) model with lexicographic order 
to break symmetry. In (jFrutos et al. 1997]) . the authors report a solution composed 
from two pairs of (t-part and m-part) sets (Ti,M_) and (T2,M2) where |Ti| = 6, 
|M_| = 16, \T 2 \ = 2, |M 2 | = 6. This forms a set S with (6 x 16) + (2 x 6) = 108 
DNA words. Marc van Dongen reports a larger solution with 112 wordso Using 
BEE, we find, in a fraction of a second, a template of size 14 and a map of size 8. 
This provides a solution of size 14 x 8 = 112 to the DNA word problem. Running 
Comet (v2.0.1) we hnd a 112 word solution in about 10 seconds using a model by 
Hakan KjellerstrandQ We also prove that there docs not exist a template of size 15 
(0.15 seconds), nor a map of size 9 (4.47 seconds). These facts were unknown prior 
to BEE. Proving that there is no solution to the DNA word problem with more 
than 112 words, not via the two part t-m strategy, is still an open problem. 

Model Based Diagnostics (MBD) is an artificial intelligence based approach that 

aims to cope with the, so-called, diagnosis problem (e.g. (jReiter 1987|> ). In (jMetodi et al. 2012[) 

we (with other researchers) focus on a notion of minimal cardinality MBD and apply 

BEE to model and solve the instances of a standard MBD benchmark. Experimental 

evidence (see (Metodi et al. 2012]) ). indicates that our approach is superior to all 

existing algorithms for minimal cardinality MBD. We determine, for the first time, 

minimal cardinality diagnoses for the entire standard benchmark. Prior attempts to 



apply SAT for MBD (for example, by Smith et al. (2005 ) and Feldman et al. (2010) 



where a MaxSAT solver is used) indicate that SAT solvers perform poorly on the 
standard benchmarks. So, BEE really makes the difference. 



2 Personal communication (Ceroid Jager), March 2012. 

3 Sec http://www.cs.st-andrews.ac.uk/~iaim/CSPLlb71 

4 See http://www.hakank.org/comet/word_design_dnal.co 
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10 Conclusion 



We introduce BEE, a compiler to encode finite domain constraints to CNF. A key 
design point is to apply bit-level techniques, locally as prescribed by the word-level 
constraints in a model. Optimizations are based on equi-propagation and partial 
evaluation. Implemented in Prolog, compilation times are typically small (measured 
in seconds) even for instances which result in several millions of CNF clauses. On the 
other hand, the reduction in SAT solving time can be larger in orders of magnitude. 

It is well-understood that making a CNF smaller is not the ultimate goal: often 
smaller CNF's are harder to solve. Indeed, one often introduces redundancies to 
improve SAT encodings: so removing them is counter productive. Our experience 
is that BEE reduces the size of an encoding in a way that is productive for the sub- 
sequent SAT solving. In particular, by removing variables that can be determined 
"at compile time" to be definitely equal (or definitely different) in any solution. 

The simplification rules illustrated in Section [5] apply standard constraint pro- 
gramming techniques (i.e. to reduce variable domains). However, equi-propagation 
is more powerful. It focuses, in general, in specializing the bit-level representation 
of the constraints in view of equations implied by the constraints. In this way it 
captures many of the well-known constraint programming preprocessing techniques, 
and more. 

Future work will investigate: how to strengthen the implementation of equi- 
propagation using BDD's and SAT solving techniques, how to improve the compiler 
implementation using better data-structures for the constraint store (for example 
applying a CHR based approach for the simplification rules), and how to enhance 
the underlying constraint language. 
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